PlantTFDB
Plant Transcription Factor Database
v4.0
Previous version: v3.0
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Thecc1EG014768t1
Common NameTCM_014768
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Malvales; Malvaceae; Byttnerioideae; Theobroma
Family LFY
Protein Properties Length: 392aa    MW: 43761.4 Da    PI: 7.2518
Description LFY family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Thecc1EG014768t1genomeCGDView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1FLO_LFY642.46.7e-19613771386
           FLO_LFY   1 mdpeafsas.lfkwdpraaaaapparlleeaavseapleaaaaaaarklr......eleelfkayGvryltvakiaelGftvstLvdmkdeel 86 
                       mdpeaf++  +fkwdpr ++a++parl+e  a  ++p++aaa+aaa+  r      ++eelf+ayG+ry+t+akiaelGftvstL++mk+eel
  Thecc1EG014768t1   1 MDPEAFTTGgFFKWDPRGVVAPTPARLMEAVAP-PQPQTAAAVAAAYMGRaprelgGIEELFQAYGIRYYTAAKIAELGFTVSTLLGMKEEEL 92 
                       9****98876**********9999999866655.55544445455544444567779************************************ PP

           FLO_LFY  87 ddlmkslseifrldllvGeryGikaavraerrrleeeeaekkrrkllsedeetaldalsqeglseepvqeekeaagsggeglgeaelvaaeek 179
                       d++m+s+s+ifr++llvGeryGikaavraerrrleee+  ++rr+l+s d+++aldalsqeglseepvq+ekeaagsgg+g  ++e+v+a   
  Thecc1EG014768t1  93 DEMMNSVSQIFRWELLVGERYGIKAAVRAERRRLEEED--SRRRHLVSGDTTNALDALSQEGLSEEPVQQEKEAAGSGGGG--TWEVVVAGG- 180
                       ************************************88..99999********************************9999..999999983. PP

           FLO_LFY 180 kseeekkkaskkkqkrkkkkelkseededeeeeededeegsgedgeerqrehPfivtepgevargkknGLDYLfdLyeqCrefLlqvqkiake 272
                            +kk+    ++rk  k+ + e d+++e e  ed+e++  +g erqrehPfivtepgevargkknGLDYLf+LyeqCrefL+qvq+iake
  Thecc1EG014768t1 181 -----RKKQ----RRRKGPKK-VVEVDNEDELEGAEDDENGDIGGYERQREHPFIVTEPGEVARGKKNGLDYLFHLYEQCREFLIQVQNIAKE 263
                       .....2333....33344444.344555666666667777888899*********************************************** PP

           FLO_LFY 273 rGekcPtkvtnqvfryakkagasyinkPkmrhYvhCYalhcLdeeasnalrrafkergenvGawrqacykplvaiaarqgwdidavfnahprL 365
                       rGekcPtkvtnqvfryakkagasyinkPkmrhYvhCYalhcLdeeasnalrrafkergenvGawrqacykplvaiaarqgwdida+fnah rL
  Thecc1EG014768t1 264 RGEKCPTKVTNQVFRYAKKAGASYINKPKMRHYVHCYALHCLDEEASNALRRAFKERGENVGAWRQACYKPLVAIAARQGWDIDAIFNAHRRL 356
                       ********************************************************************************************* PP

           FLO_LFY 366 siWYvPtkLrqLChlerskas 386
                       +iWYvPtkLrqLCh+er++a+
  Thecc1EG014768t1 357 AIWYVPTKLRQLCHAERNNAA 377
                       ******************986 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
PfamPF016983.2E-2051377IPR002910Floricaula/leafy protein
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0006355Biological Processregulation of transcription, DNA-templated
GO:0010077Biological Processmaintenance of inflorescence meristem identity
GO:0010582Biological Processfloral meristem determinacy
GO:0003700Molecular Functiontranscription factor activity, sequence-specific DNA binding
GO:0031490Molecular Functionchromatin DNA binding
GO:0042803Molecular Functionprotein homodimerization activity
GO:0043565Molecular Functionsequence-specific DNA binding
GO:0043621Molecular Functionprotein self-association
Sequence ? help Back to Top
Protein Sequence    Length: 392 aa     Download sequence    Send to blast
MDPEAFTTGG FFKWDPRGVV APTPARLMEA VAPPQPQTAA AVAAAYMGRA PRELGGIEEL  60
FQAYGIRYYT AAKIAELGFT VSTLLGMKEE ELDEMMNSVS QIFRWELLVG ERYGIKAAVR  120
AERRRLEEED SRRRHLVSGD TTNALDALSQ EGLSEEPVQQ EKEAAGSGGG GTWEVVVAGG  180
RKKQRRRKGP KKVVEVDNED ELEGAEDDEN GDIGGYERQR EHPFIVTEPG EVARGKKNGL  240
DYLFHLYEQC REFLIQVQNI AKERGEKCPT KVTNQVFRYA KKAGASYINK PKMRHYVHCY  300
ALHCLDEEAS NALRRAFKER GENVGAWRQA CYKPLVAIAA RQGWDIDAIF NAHRRLAIWY  360
VPTKLRQLCH AERNNAAASS SVSGGPDHMA F*
3D Structure ? help Back to Top
Structure
PDB ID Evalue Query Start Query End Hit Start Hit End Description
2vy2_A1e-1102153752162PROTEIN LEAFY
2vy1_A1e-1102153752162PROTEIN LEAFY
Search in ModeBase
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1180186RKKQRRR
Binding Motif ? help Back to Top
Motif ID Method Source Motif file
MP00095SELEXTransfer from AT5G61850Download
Motif logo
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieveRetrieve
Annotation -- Nucleotide ? help Back to Top
Source Hit ID E-value Description
GenBankDQ1497251e-140DQ149725.1 Theobroma cacao LEAFY (LEAFY) mRNA, partial cds.
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_007038161.10.0Floricaula/leafy
SwissprotO040640.0FLLH_POPTR; Floricaula/leafy homolog
TrEMBLA0A061G0H30.0A0A061G0H3_THECC; Floricaula/leafy
STRINGPOPTR_0015s11820.10.0(Populus trichocarpa)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MalvidsOGEM79072841
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT5G61850.11e-160floral meristem identity control protein LEAFY (LFY)
Publications ? help Back to Top
  1. Motamayor JC, et al.
    The genome sequence of the most widely cultivated cacao type and its use to identify candidate genes regulating pod color.
    Genome Biol., 2013. 14(6): p. r53
    [PMID:23731509]